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Summary of Progress 


During the period December 1, 1986 - May 31, 1987, progress was made 
in the following areas: 

1) Concatenated Codes Using Bandwidth Efficient Trellis Inner Codes. 

A paper summarizing our work on the performance of bandwidth effi- 
cient trellis inner codes using two-dimensional MPSK signal constellations 
in a NASA concatenated coding system has been submitted for publication 
to the IEEE Transactions on Communications [1]. These coding schemes 
achieve an effective information rate (information bits transmitted per unit 
bandwidth) at least double that currently used in NASA’s TDRS system. 
They also provide large coding gains, which make them desirable for use in 
high speed satellite communication systems. Implementations of an inner 
coding scheme similar to those proposed in [1] have been demonstrated at 
speeds up to 120 Mbps [2]. 

Using a soft decision Viterbi decoder for the inner code, we have shown 
in [1] that coding gains of 4dB @ 10 -6 and 6dB @ 10 -9 can be achieved 
without bandwidth expansion using 16-state trellis coded 8 PSK modula- 
tion as the inner code and a length 255 Reed-Solomon (RS) outer code. 
With a modified Viterbi decoder for the inner code which passes some era- 
sure symbols to the outer decoder, a decoded bit error rate (BER) of 10 -16 , 
with an erasure rate of 10 -3 , can be achieved at an Eb/N 0 = 5.5 dB and 
only 10 % bandwidth expansion. This paper [1] is included as Appendix A 
of this report. 

We have also continued our work on trellis coded modulation using 
multi-dimensional signal sets. A paper on this subject has been submit- 
ted to the IEEE Transactions on Information Theory [3]. A copy of this 
paper was included in our previous report. A talk on this subject was 
also presented at the 1987 Conference on Information Sciences and Sys- 
tems [4], and the Principal Investigator has been invited to give a lecture 
on multi-dimensional trellis coded modulation at the IEEE Information 
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Theory Workshop, in Bellagio, Italy [5]. 

We are currently preparing two additional papers in this area for pub- 
lication [6,7]. One summarizes our work on multi-dimensional 16 PSK 
code constructions, while the other investigates the performance of multi- 
dimensional trellis coded modulation as inner codes in a concatenated cod- 
ing system. Copies of these papers will be included in a future report. 


2) Bounds on the Minimum Free Euclidean Distance of Bandwidth Efficient 
Trellis Codes. 

On a more theoretical note, a paper summarizing our work on lower 
bounds on the minimum free Euclidean distance of bandwidth efficient 
trellis codes has been submitted to the IEEE Transactions on Information 
Theory [8], and is included as Appendix B of this report. This paper proves 
achievable lower bounds on free distance for trellis codes and establishes the 
existence of good trellis coded modulation (TCM) schemes for a variety 
of signal constellations, including MPSK. These bounds can be used as a 
means of comparing the performance of various TCM schemes, particularly 
at large constraint lengths. 

A weakness of this bound is that it is not tight at small constraint 
lengths, and we are currently working on a method of tightening the bound. 
Two additional papers are being planned, one on a tightened bound and 
the other on the basic structural properties of TCM schemes. 


3) Performance Analysis of Bandwidth Efficient Trellis Codes on Non- 
Uniform Channels 

Communication over real world channels, such as mobile satellite chan- 
nels, suffers from non-uniform disturbances that cannot be described by ad- 
ditive white Gaussian noise. One important such disturbance is the multi- 
plicative noise arising from multipath reception at the receiver [9]. Commu- 
nication channels suffering from multipath are known as Rayleigh/Rician 
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fading channels. 

We have begun an investigation of the performance of TCM schemes on 
fading channels. Our preliminary results indicate that bandwidth efficient 
trellis coding is feasible on such channels, but that the important design 
parameter is no longer the minimum free Euclidean distance. Two new pa- 
rameters, the “effective length” and the “minimum product distance” axe 
more important. Intuitively, a good code in a nonuniform noise environ- 
ment should try to spread the “distance” between two codewords over all 
the branches over which the codewords differ. This is the same idea used 
in diversity transmission, which is usually employed on fading channels. A 
paper summarizing our preliminary results is included as Appendix C [10] 
of this report. 
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Abstract 

High rate concatenated coding systems with bandwidth efficient 
trellis inner codes and Reed-Solomon (RS) outer codes are investi- 
gated for application in high speed satellite communication systems. 
Two concatenated coding schemes are proposed. In Scheme I, the in- 
ner code is decoded with soft-decision Viterbi decoding and the outer 
RS code performs error- correction only decoding (decoding without 
side information). In scheme II, the inner code is decoded with a mod- 
ified Viterbi algorithm which produces reliability information along 
with the decoded output. In this algorithm, path metrics are used 
to estimate the entire information sequence, while branch metrics are 
used to provide reliability information on the decoded sequence. This 
information is used to erase unreliable bits in the decoded output. An 
errors-and-erasures RS decoder is then used for the outer code. These 
two schemes have been proposed for high speed data communication 
on NASA satellite channels. The rates considered are at least double 
those used in current NASA systems, and our results indicate that 
high system reliability can still be achieved. 


’This work was supported by NASA Grant NAG5-557. 
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I Introduction 


To meet the ever-increasing demand for satellite communication and voice- 
band telephone services, the search for bandwidth and power-efficient mod- 
ulation/coding systems has recently become a very active research area 
[1-9]. The basic idea of bandwidth efficient trellis codes, or trellis coded 
modulation (TCM), pioneered by Ungerboeck [1], is that by trellis cod- 
ing onto an expanded modulation set (relative to that needed for uncoded 
transmission) and by designing the trellis codes to maximize the minimum 
free Euclidean distance between allowable code sequences, asymptotic (high 
signal-to-noise ratio) coding gains of 3 ~ 6dB compared with an uncoded 
system can be achieved without bandwidth expansion. The performance of 
TCM systems can be measured in terms of their asymptotic coding gain 7 
over an uncoded system with the same spectral efficiency. The asymptotic 
coding gain increases with the number of states in the trellis. For exam- 
ple, Ungerboeck’s simplest scheme uses a 4-state, rate 2/3 trellis encoder 
with 8-PSK modulation and achieves a 7 = 3 dB coding gain over uncoded 
QPSK modulation without bandwidth expansion. A more complex 16-state 
encoder achieves a 7 = 4.1 dB coding gain. 

Practically, we are more concerned with the real coding gains which can 
be achieved at moderate values of the decoded bit error rate (BER) than 
with the asymptotic coding gain. For satellite communication, decoded 
BER’s in the range of 10~ 6 ~ 10" 9 are reasonable requirements. Channel 
capacity results imply that real coding gains over uncoded QPSK of 8.7 dB 
at a BER of 10~ 6 and 15.5 dB at a BER of 10~ 9 can be achieved [1,2]. 
Ungerboeck’s 4-state code achieves a 3 dB real coding gain over uncoded 
QPSK without bandwidth expansion both at 10~ 6 and 10 -9 BER’s, while 
his 16-state code achieves a 3.7 dB gain at 10~ 6 and a 3.9 dB gain at 
10 -9 . In this paper, we investigate the use of concatenated codes with 
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TCM inner codes and Reed-Solomon (RS) outer codes to achieve larger 
real coding gains over uncoded QPSK at BER’s of 10 -6 ~ 10“ 9 with little 
or no bandwidth expansion. 

Concatenated coding has long been used as a practical means of achiev- 
ing long block or constraint lengths (and thereby achieving large coding 
gains) [10-14]. One such system consists of Viterbi decoding of an (n,l,m) 
convolutional inner code with BPSK modulation concatenated with an 
outer RS code. However, the overall effective information transmission 
rate of this system is less than l/n,n > 2, bit per signal dimension [11, 12]. 
This system achieves increased power efficiency at the expense of decreased 
spectral efficiency. 

In this paper, we present a bandwidth efficient trellis inner code/RS 
outer code concatenated coding system for use in high speed satellite com- 
munication systems. The goal of using bandwidth efficient trellis codes as 
inner codes is to achieve large coding gains over uncoded QPSK without 
bandwidth expansion. We will see in the following sections that such a 
system can provide coding gains from 4 to 8 dB at BER’s of 10~ 6 ~ 10~ 9 
with little or no bandwidth expansion (while the asymptotic coding gain 
can be as large as 20 dB), when uncoded QPSK is used as our reference 
system. 

There are two types of bandwidth efficient trellis codes: lattice-type trel- 
lis coded modulation (e.g., QASK) [1,2, 3, 4, 5] and constant-envelope-type 
trellis coded modulation (e.g., MPSK) [1,6, 7, 8, 9]. The former is more power 
efficient than the latter under an average energy constraint. However, since 
constant-envelope signals mitigate the non-linear effects of TWT amplifiers 
and can be implemented for high speed modem operation [15], trellis coded 
MPSK modulation is the preferred modulation scheme for satellite chan- 
nels. Hence, we will consider the case when trellis coded MPSK modulation 
is used as the inner code. 
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Fig. 1 shows the encoding-decoding block diagram of the concatenated 
coding system. Encoding is performed in two stages. An information se- 
quence of Kb bits is divided into K symbols of b bits each, and each b - bit 
symbol is regarded as an element of GF(2 b ). These K symbols axe used as 
the input to the RS encoder. The output of this encoder is an iV-symbol 
codeword which is symbol-interleaved and then serially encoded by the 
trellis encoder. Decoding is accomplished in the reverse order. The output 
of the maximum-likelihood (Viterbi) trellis decoder contains bursty errors. 
The main purpose of the inner code is to shape the distribution of the errors 
on the inner channel, rather than to correct errors, particularly when the 
inner code rate is around the cutoff rate of the inner channel. When an RS 
outer code is used, this shaping compresses the random errors on the inner 
channel into symbol errors corresponding to the symbol size used by the 
outer code [13]. The use of bandwidth efficient trellis codes as inner codes 
has one important feature: because of the bandwidth efficient property of 
the code, it compensates for the bandwidth expansion introduced by the 
outer RS code, so that the overall system suffers no bandwidth expansion. 

Two concatenated coding schemes axe studied. Scheme I, a Viterbi de- 
coded trellis inner code/RS outer code system, where the Viterbi decoder 
outputs are provided to the outer decoder without side information, is stud- 
ied in Section II. Code performance results for several examples of scheme 
I are shown in figures 2-3 sold tables 1-2. Asymptotic coding gains axe also 
determined and presented in figures 4-6. A modified Viterbi decoding algo- 
rithm for trellis codes with output reliability information is introduced in 
Section III. Scheme II, a concatenated coding system with side information 
available to the outer decoder, is then studied in Section IV. In the mod- 
ified algorithm, path metrics axe used to estimate the entire information 
sequence, and branch metrics axe used to provide reliability information on 
the decoded sequence. This is done by erasing a certain number of bits 
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on the survivor of a compaxison between the two most likely paths whose 
metrics axe within some fixed threshold of each other. Code performance 
results for several examples of scheme II are shown in figures 8-9. Finally, 
in Section V, we summarize our results and draw some conclusions which 
are useful in the design of concatenated coding systems. 

II Coding Scheme I - Without Side Information 

A concatenated coding system is shown in Figure 1. As mentioned in Sec- 
tion I, we use bandwidth efficient trellis codes as inner codes and (N,K) RS 
codes with symbols over GF(2 b ) as outer codes. Decoding is accomplished 
in the reverse order of encoding. The inner decoder uses the Viterbi al- 
gorithm. The outputs of the inner decoder (without side information) axe 
grouped into 6-bit symbols. Because the lengths of the bursts of decoding 
errors made by the Viterbi decoder are widely distributed, symbol dein- 
terleaving is used so that errors in the individual RS-symbols of one block 
(N symbols) are independent; otherwise, a very long block code would be 
required for efficient operation. 

In this concatenated coding system, the bandwidth efficient inner trellis 
codes have two functions: 

1. To compensate for the bandwidth expansion introduced by the outer 
RS code; 

2. To compress the random errors on the inner channel into symbol 
errors which can be corrected by the outer RS code. 

In the reminder of this section, we study the performance of Scheme 
I. We first calculate the decoded BER and determine the coding gain, at 
a given decoded BER, in terms of the “information energy-to-noise power 
density ratio”, Eb/N 0 . Then we derive an expression for the asymptotic 
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coding gain for very large values of E^/Nq. All coding gains are given with 
reference to uncoded QPSK modulation. 


A. Decoded BER 

Let the minimum distance of the (N,K) RS code be d 2 . When an N- 
symbol block is received by the outer decoder, it performs errors-only de- 
coding. That is, if the IV-symbol block contains t 2 = {(d 2 — 1)/2J or fewer 
errors, the errors are corrected; otherwise, decoding fails. t 2 is called the 
error-correcting-capability of the (N,K) RS code [14]. The decoded bit error 
rate at the output of outer decoder is closely approximated by 


Pb^ 


d 2 

2 N 


N 


E 

i=t 2 + 1 


W - P.) 


N-i 


(i) 


where P 3 is the symbol error probability into the outer decoder. In this 
subsection, we give several examples of Scheme I. For each example, we 
use an RS code with symbols over GF(2 S ) and N = 255 as the outer code. 
Rate R\ trellis coded 2 L -ary PSK modulation, with L = 3 or 4, is used as 
inner code. Since 2 L -ary PSK modulation is two dimensional, the effective 
information rate of the inner code is given by 


Rgfj — — Ri bits/dimension, 


( 2 ) 


and the overall effective information rate of the concatenated coding system 
is 

R e ff = Reffjy = R-eff -^2 bits/ dimension, (3) 

where R 2 = K/N is the rate of the (N,K) RS code. 

The symbol error probability P a in (1) is obtained by computer sim- 
ulation on an AWGN channel unless otherwise noted, and (1) is used to 
estimate the final decoded BER. 
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Example 2.1 

Periodically time-varying trellis codes (PTVTC) of rate R x = (2 P + 
i)/3P, P > 2, 1 < i < P, are used as inner codes with 8-PSK modulation 
[6]. These codes are especially designed for high data rate channels with 
low decoding complexity. This PTVTC/8-PSK system has an effective 
information rate = R x Zj2 = (2P + i)/2P > 1 bits/dimension, so it is 
possible to achieve an overall effective information rate P e // equal to 1 bit/ 
dimension or greater. Moreover, because of the periodic property of the 
code, its trellis structure is the same as the trellis structure of a lower-rate 
code, and as a result the complexity of the decoder is reduced significantly 
[ 6 ], 

The PTVTC’s are decoded by the Viterbi algorithm assuming no de- 
modulator output quantization. Fig. 2.1 shows the decoded BER P & vs. 
E(,/N 0 for concatenated codes with 4-state inner codes and 3 different RS 
outer codes, where the outer code rate R 2 is chosen such that R e jj = 1 
bit /dimension. Fig. 2.2 gives similar results for 16-state inner codes. In 
Figs. 2.3 and 2.4 we show the required P&/1V 0 to achieve decoded BER’s 
of 10 -6 and 10~ 9 for 4-state and 16-state inner codes, respectively, as a 
function of R e ff- 

From Figs. 2.1 - 2.4 we observe that, for a given R e ff, systems with 
lower inner code rates give better performance. Also note that at high 
effective code rates, e.g., R e ff = 1 bit /dimension, we still get large coding 
gains. For example, with a 16-state, Ri = 7/9 PTVTC, the coding gain 
equals 3.92 dB at P(, = 10 -6 and 6.18 dB at P& = 10 -9 . Even for a 4-state, 
Pi = 7/9 PTVTC, coding gains of 3.68 dB and 5.98 dB at Pf, = 10 -6 
and Pj = 10 -9 , respectively, can be obtained. Note that as the number of 
trellis states increases from 4 to 16, only about 0.2 dB more coding gain 
is obtained. This appears to be a characteristic of coded modulation in 
general [1]. We will see this more clearly when we derive the asymptotic 
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coding gain. 


Example 2.2 

Ungerboeck’s 16-state, rate R x = 2/3 convolutional code with 8-PSK 
modulation is used as the inner code. This code has an effective information 
rate R^/f = 1 bit/dimension. Therefore, the overall effective information 
rate, R e ff, of the concatenated coding system is less than 1. Again, the 
inner code is decoded by the Viterbi algorithm without demodulator output 
quantization. The final decoded bit error probability P& is shown in Fig. 
3.1 for 2 different RS outer codes. Fig. 3.2 shows the required E^/Nq to 
achieve Pj = 10 -6 and 10~ 9 as a function of P e //- At P* = 10" 6 and 10 -9 , 
coding gains of 4.96 dB and 7.25 dB, respectively, can be obtained with 
only 12.5% bandwidth expansion. This compares with 3.7 dB and 3.9 dB 
coding gains, respectively, with no bandwidth expansion, for the inner code 
alone. 

Example 2.3 

In this example, two kinds of trellis codes are used as inner codes. The 
first kind uses 8-PSK modulation twice per trellis interval (forming a 64- 
ary set), and this set is then coded with a R x = 5/6 trellis code [7,8]. 
Viewed more generally, code symbols are mapped onto a four-dimensional 
signal set (8-PSK x 8-PSK) with 5 information bits for every two 8- 
PSK signals. Thus the effective information rate is R^jj — 5/4 = 1.25 
bits/dimension. The second kind uses rate Pi = 3/4 convolutional codes 
with 16-PSK modulation [9]. The effective information rate is R^/f = 3/2 
= 1.5 bits/dimension. This code is more bandwidth efficient than the Pi 
= 5/6, 8-PSK code, but it is less power efficient. 

Let df, E„ and A be the normalized minimum free Euclidean distance, 
the energy per modulation symbol, and the number of nonzero information 
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bits in the set of paths at distance df from the correct path, respectively. 
Then at high signal-to-noise ratios, the bit error probability at the Viterbi 
decoder output is approximated by 



The RS-symbol (over GF(2 b )) error probability at the output of the Viterbi 
decoder is bounded by 


Ps < be. (5) 

The final decoded bit error probability Pi , can be obtained from (4.1), (4.2), 
(5), and (1). Tables 1.1 and 1.2 list the coding gains over uncoded QPSK 
at P h = lO” 6 and 10" 9 for the R x = 5/6, 8-PSK and R i = 3/4, 16-PSK 
inner codes, respectively, where we have chosen the outer RS code rate R 2 
such that the overall effective information rate R e fj = 1 bit/dimension. 
In both cases, the relatively poor performance of the 8-state and 16-state 
codes compared to the 4-state codes is due to a large number of minimum 
free distance paths. 



Having studied the performance of concatenated coding systems for 
small and medium values of E^/No, it is interesting to look at how the 
systems perform at large values of E^/No, i.e., the asymptotic coding gain. 
This also gives us some insight into concatenated coding system design. 

When uncoded QPSK is used on an AWGN channel with coherent de- 
modulation, the demodulator output BER is given by 

a = Q for large E t /N„. (6) 
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For our concatenated coding system, we assume rate Ri trellis coded 2 L - 
ary PSK modulation as the inner code and an (N,K) RS code with symbols 
over GF(2 b ) as the outer code. Then 


E a — R 2 R\LE b , 


and from (2) and (3) it follows that 

E, = 2R 2 R%E b = 2 R eff E b . (7) 


The BER at the output of a Viterbi decoder without demodulator output 
quantization is approximated by 


e ~ AQ 


“ 2 e 



d)E s 


2No 

A f R e}f E b 


= AQ 


<PfR e jfE b 

M No 


2 N 0 


( 8 ) 


From (8), (5), and (1), the final decoded BER P b of the concatenated coding 
system is approximated by 




Pb ~ 2N^ + 1 ) P s 2+1 ( 1 - P ^ 

d, / N \ f Ab\ ti+l 
“ + e ~ • 


( 9 ) 


Comparing (6) and (9), we see that for a fixed E b /No, the (negative) expo- 
nent with concatenated coding is larger by a factor of dyi? e //(t 2 + l)/2 than 
the exponent without coding. Since the exponential term dominates the er- 
ror probability expression for large E b / No, we define the asymptotic coding gain 
(in decibels) as 


7 = 101og 10 


dfReffjij + 1 ) 
2 


( 10 ) 
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For RS codes, 


tl 


N-K N( 1 - R 2 ) 


( 11 ) 


2 2 

Substituting (11) into (10), the asymptotic coding gain can be rewritten as 


7 = 101o glo 2^pi^ + 1 ) 


= 10 log 10 + 10 log 10 R, ( W(1 - fli) + l) . 


( 12 ) 


Note that in (12) the first term denotes the coding gain due to the band- 
width efficient inner code and the second term represents the coding gain 
contributed by the outer code. This implies that, for very large Eb/N 0 , the 
inner and the outer codes in a concatenated coding system can be designed 
independently of each other. 

Taking derivatives of 7 with respect to R 2 , the maximum coding gain 
is achieved at R 2 = 1/2 + 1 /N for a fixed inner code, and 


^Imax 


= 10 log.0^1 + 1010^1(^ + 1 + 1). (13) 


Figure 4 shows the asymptotic coding gain 7 for Example 2.1. Figure 5 
shows 7 for Example 2.2, Figure 6.1 shows 7 for Example 2.3 when Ri = 5/6 
coded 8-PSK is used as the inner code, and Figure 6.2 shows 7 for Example 
2.3 when -Ri = 3/4 coded 16-PSK is used as the inner code. The asymptotic 
coding gains shown in these figures axe very large, even at high effective 
information rates. For example, in Figure 4 at R t jj = 1 bit /dimension, the 
asymptotic coding gain can be as large as 17 dB. 

Before finishing this section, we can draw several conclusions from the 
above discussion. 


1. For a given R e jf, there exists an R^jj which optimizes system perfor- 
mance, or conversely, for a given R^ff, there exists an optimum value 
of R e ff. 
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2. System performance is very sensitive to fZ 2 at very low and very high 
values of R 2 . 

3. For small or moderate values of E\,/Nq, and for a fixed R e fj, a lower 
R^eff is preferred as long as the outer code rate R 2 is not too high. 

4. Because df increases slowly as the number of trellis states increases, 
choosing inner codes with a small number of trellis states increases 
the data transmission rate (by reducing the number of decoder com- 
putations) with only a slight sacrifice in system performance. 

Ill A Modified Viterbi Algorithm 

It has been shown that the channel capacity of a concatenated coding sys- 
tem with side information is an upper bound on the capacity without side 
information [13]. The difference between the two capacities is significant, 
especially on very noise channels. Therefore, it is advantageous if the inner 
decoder can provide some kind of reliability information (side information) 
about its estimated output which will aid the outer decoder. In scheme I, 
discussed in Section II, the inner code is decoded by the Viterbi algorithm 
and the output of the decoder is a sequence of “hard decisioned” binary 
digits, i.e., no side information is associated with the output. To provide 
side information, it becomes necessary to modify the conventional Viterbi 
algorithm so that soft decisioned outputs axe made available to the outer 
decoder. 

Several ideas have been proposed to provide some kind of side infor- 
mation with the convolutional decoder outputs. Zeoli [16] proposed a con- 
catenated coding system that employs a long constraint length (m = 31) 
convolutional code obtained by annexing a tail to a (3,1,7) convolutional 
code. The longer code is then decoded by the same Viterbi decoder as 
the short code, with the exception that the information sequence along the 
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best path to each state is treated as correct and used to “cancel” the effect 
of the longer tail from the encoded sequence. The tail provides excellent 
error-detection capability once the decoder starts to make mistakes. But 
the algorithm is subject to very serious error propagation and the decoder 
has to be reset frequently. 

Another method of providing reliability information with the decoder 
output is to compute the a posteriori probability of each decoded symbol 
from the decoder being correct [12]. However, the a posteriori probabil- 
ity must be computed for each branch using a recursive method and real 
numbers must also be stored. This computation process slows down the 
decoder, so it is not suitable for high speed systems. 

A third alternative is to use the “Viterbi decoding algorithm for con- 
volutional codes with repeat request” [17] to extract reliability information 
from the inner decoder. When all the path metrics at some level of the 
trellis are below a predetermined threshold, the received sequence up to 
that level is erased. But this approach has two major drawbacks: 1) when 
a long received sequence is erased, this erasure information cannot be used 
by the outer decoder because the number of erasures may exceed the era- 
sure correction capability of the outer code; 2) with high probability, most 
of the symbols in the received sequence can be decoded correctly and hence 
should not be erased. 

In the following, we propose a decoding technique based on the Viterbi 
algorithm which erases only the information symbols that are “probably 
in error” . This erasure procedure does not affect the decoder’s selection of 
the most likely path. Therefore, the decoder is still maximum-likelihood. 
Furthermore, the algorithm is very simple to implement and capable of high 
speed operation. 

As the name suggests, an (n,k,m) trellis code is best described in terms 
of its trellis diagram. In the decoding of trellis codes by the Viterbi al- 
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gorithm, a first-event error is made at an arbitrary level j if the correct 
path is eliminated for the first time at level j in favor of the incorrect path. 
This is illustrated in Figure 7. The incorrect path must be a path that 
had previously diverged from the correct path. For the example shown in 
Figure 7, let 


and 


Vj = (• • 

• , ^-3, 

Vj-2 

Vj- 1, 

Vi) 

V j = (• • 

■ 1 v;_ 3 , 


V- 

v :-v 

V/) 


represent the correct path and the incorrect path, respectively, where VJ(Vj') 
is the symbol on the i— branch of the correct (incorrect) path. Also let 


Uj = (• • • , Uj-2, Uj. 2 , Uj-H Uj) 


and 


U'_„ u;. v Dp 

represent the information sequences associated with paths V,- and V', re- 
spectively, where Ui(U-) is a binary k-tuple. Denote a branch metric by 
A(K) and a path matric by A(V ; ). From Figure 7 we observe that the 


bits Uj_ 3 , 


U'- 2 , 


V-i, 


Uj are unreliable and should be erased. This 


observation motivates us to propose the following decoding algorithm. 


Modified Viterbi Algorithm: 

For each state at level j, select the path V' =(•••, VJ_ l+2 , • • • , 

Vj_ j, Vj) {l > 1) that has the largest metric A(V') and the path Vj = 
(• • • , Pj_/ + 1 , Vj_z + 2 , • • • , V)-!, Vj) that has the next largest metric A(Vj). 
Because V) has the largest metric, V'- survives at that state. Moreover, if 

A(V') > A(vp + r (T > 0), ( 14 ) 
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decoding continues as in the conventional Viterbi algorithm. On the other 
hand, if 

A(V') < X(\j) + T, (15) 

then the bits Uj-v Uj on *be surviving path are 

erased and decoding continues. 

This algorithm applied to any (n,k,m) trellis code. If, however, all k 
input bits axe shifted into memory, there can be no parallel transitions in 
the trellis. Since the last km information bits associated with an error event 
of length / branches must be correct for the incorrect path to remerge with 
the correct path, decoding errors are confined to the first 1 — m branches. 
In this case, when (15) is satisfied, only the bits TJ' y -(+i’ k r j_ ;+2 , •• • , Uj_ m 

on the surviving path are erased. 

Note that if T = 0, the algorithm reduces to the conventional Viterbi 
algorithm. As an illustration of the algorithm, in Figure 7, for some state 
at level j, suppose that A(V') > A(V J ), so the incorrect path V' survives. 
If (15) is satisfied, then the bits Uj_ 3 , Uj_ 2 , Uj_ x , Uj are erased. Note 
that (15) is equivalent to 


KV^+KVJ.^+XiVj^+XiVJ) < A(V i _ 3 ) + A(V j _ 2 )+A(V J _ 1 )+A(V i )+T. 

(16) 

This implies that, if the metrics of the two 4-branch sequences axe too 
“close” in terms of the threshold T, the corresponding information sequence 
on the surviving path is not reliable, and therefore a tag should be attached 
to it which indicates its degree of reliability. The simplest way of doing this 
is to erase it. 

From (16) we see that this erasure decision is made on the most recent l 
branches for an /-branch error event, and hence it is a “local” estimate. On 
the other hand, the path estimate is a “global” estimate. From this point 
of view, we see that the algorithm is constructed based on the following 
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ideas: 

1. By letting the path with the largest path metric survive, we axe choos- 
ing the maximum-likelihood estimate. 

2. By performing a “branch comparison” over the most recent l branches, 
we provide some reliability information on the maximum-likelihood 
path. 

Because the reliability is estimated over only l branches, the reliability 
estimates are suboptimum. However, in feedback decoding of convolutional 
codes, decisions are made with a delay of only one constraint length [14]. 
Hence, we can say that the reliability estimates have a certain degree of 
precision. Ideally, the number of erased k-tuples, /, should equal the length 
of the error event. Practically, however, choosing l equal to the average 
error event length should be sufficient. 

IV Coding Scheme II - With Side Information 

The encoding process is the same as in Scheme I. Decoding is done in 
two steps. First, the inner trellis code is decoded by the modified Viterbi 
algorithm presented in the last section. The outputs of the inner decoder 
consist of binary digits as well as erased bits, and they axe grouped into 
6-bit symbols, deinterleaved, and sent to the outer decoder. Symbols which 
contain any erased bits axe considered as erasures by the outer decoder. 

Let i be the number of erased symbols in an RS outer codeword of length 
N. The outer decoder declares an erasure (or raises a flag) for the entire 
block of N symbols if i is greater than the erasure-correction threshold T es , 
where 

T es < ( d 2 - 1) (17) 

and d 2 is the minimum distance of the outer (N,K) RS code. If i is less than 
T es , the outer decoder starts errors-and-erasures decoding on the iV-symbol 
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block. Let t(i) be the error-correction threshold for a given i, where 


t(i)=[(d 2 -i-l)/2\. (18) 

If the syndrome of the JV-symbol block corresponds to an error pattern of 
i erasures and t(i) or fewer symbol errors, errors-and-erasures decoding is 
successful. The values of the erased symbols, and the values and the loca- 
tions of the symbol errors, are determined based on the decoding algorithm. 
However, if more than t(i) symbol errors are detected, then the outer de- 
coder again declares an erasure (or raises a flag) for the entire iV-symbol 
block. 

Let P, and P e be the RS-symbol error probability and RS-symbol era- 
sure probability at the input of the outer decoder, respectively. The prob- 
ability of correct decoding of a block is given by 
Xs» /N\ ( N— A 

p c = E (>) p;Z [ i ) p;( i -p.- p.f-^, (i9) 

where T ea and t(i) are defined in (17) and (18). Let Pj, e and Pm denote the 
probabilities of block erasure and incorrect decoding, respectively. Then 

Pc + Pbe + Pbi = 1? (20) 

and 


Pbe + Pbi = 

+ 

The probability of incorrect decoding of a block is not easy to determine, 
but it can be upper bounded by [18] 

Te* /jyx N-i / JV— A 

Pu<E(i)Pi £ j P/(l - P. - P') N -‘-\ (22) 

x*=0 V J \ / 


1 -Pc 

Tes /m N—i / 

E(s)p: e ; p/(i - p e - 

«'= 0 v 7 j=t(i)+l \ / 

E (?) P'X 1 - p*)"-*- (21) 

i=T e ,+ 1 v 7 
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and the final decoded BER of the concatenated coding system is approxi- 
mated by 



(23) 


In the following we consider several examples for Scheme II. The outer 
code is again an RS code of length N = 255. Bandwidth efficient inner 
trellis codes axe decoded by the modified Viterbi algorithm. We use B to 
denote the number of erased branches in the modified Viterbi algorithm, 
and the erasure threshold is chosen to be 


T = ( T'd/E b /N 0 ) 2 , 


(24) 


where d / is the minimum free Euclidean distance of the inner code normal- 
ized by the signal energy at the demodulator output, and T' is a normalized 
threshold. The RS-symbol error probability P s and the RS-symbol erasure 
probability P e at the input to the outer decoder axe determined by com- 
puter simulation. In decoding the outer code, fixed erasure-correction and 
error-correction thresholds T ea < (d 2 — 1) and t < [(d 2 — 1 ) /2 J , which are 
independent of the number of erased symbols in an iV-symbol block, axe 
assumed, and (21) and (22) are modified as 

/ j\r\ N—i / jy — A 

Poe + PU = £ (?) Pi E i I Pi(l -Pe- P.)"-*-* 

*=0 j=mtn(t(f),t)+l > / 

+ E -(fjP’a-Pe)"-*', ( 25 ) 

»=T e .+l V ' 


and 


Pbi< 



N-i 


E 

j = <^2 — *— mm(t(i),t) 


P/(l - P, - P e ) N ~^. (26) 


Usually, the probability of block erasure P{, e is much larger than the prob- 
ability of block decoding error Pf,,. Therefore, Pj e + P&,- is a tight bound on 
Pi, e . In the examples below we compute the sum of the probability of block 
erasure and block decoding error, P& e + Pbi, and the final decoded BER P 5 . 
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Example 4.1: 

As in Example 2.1, the PTVTC’s of Ri = (2 P + i)/3P, P > 2, and 

I < i < P with 8-PSK modulation are used as inner codes. Only 16-state 
codes axe considered. P& and P& e + Pm, with Ri = 7/9, 5/6, 8/9, axe shown 
as functions of t in Figures 8.1, 8.2, and 8.3, with T ea = 10, 8, and 9, 
respectively. 

Example 4.2: 

As in Example 2.2, Ungerboeck’s Pj = 2/3 coded 8-PSK with 16 trellis 
states is used as the inner code. P& and Pf, e + Pm axe shown in Figure 9 as 
a function of t with T ea = 10. 

From the above examples, we see that for a given E^/Nq, the decoded 
BER of Scheme II is significantly lower than that of Scheme I. Scheme 

II provides us with flexibility in the system design. Tradeoffs between 
the decoded BER and the probability of block erasure can be achieved by 
changing T', B,T ea , and t. 

V Summary and Conclusions 

In this paper we considered high rate concatenated coding systems with 
bandwidth efficient inner codes and RS outer codes for application to satel- 
lite communication systems. Specifically, we considered trellis inner codes 
with MPSK modulation which achieve effective information rates equal to 
or greater than 1 bit/dimension. These codes mitigate the non-linear ef- 
fects of TWT amplifies and make use of soft decision Viterbi decoding to 
achieve significant coding gains on bandlimited channels. 

Two types of concatenated coding systems were studied. Scheme I oper- 
ates without side information, while Scheme II uses side information. The 
performance of Scheme I was studied by computer simulations, formula cal- 
culations, and by asymptotic coding gain derivations. Our results indicate 
that coding gains from 4 to 8 dB can be achieved at decoded BER’s of 10~ 6 
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to 10 -9 with little or no bandwidth expansion. For example, with 16-state 
inner codes with 8-PSK modulation, coding gains of 6.18 dB at 10 -9 and 
3.92 dB at 10 -6 without bandwidth expansion (Example 2.1, Figure 2.2) 
can be achieved. With only 12.5 % bandwidth expansion, coding gains 
of 7.25 dB at 10 -9 and 4.96 dB at 10 -6 (Example 2.2, Figure 3.1), are 
achievable. These 16-state decoders are capable of high speed operation. 

In studying the performance of Scheme II, we first proposed a modified 
Viterbi algorithm for decoding trellis codes. Side information was provided 
to the outer decoder in the form of symbol erasures. The outer RS code was 
then decoded by an errors-and-erasures decoder. A significant improvement 
in decoded BER was obtained for Scheme II. For example, decoded BER’s 
as low as 10 -16 , with an erasure rate no more than 10 -3 , can be achieved 
with a 16-state inner trellis code, 8-PSK modulation, and an overall effective 
information rate of 0.9 bits/dimension at an E^/Nq of 5.45 dB (Example 
4.2, Figure 9). For systems where block erasures are allowed, such as ARQ 
systems, Scheme II is highly recommended. 

References 

[1] G. Ungerboeck, “Channel Coding with Multilevel/Phase Signals,” 
IEEE Trans, on Inform. Theory , IT-28, pp. 55-67, January 1982. 

[2] G. D. Forney, Jr., et. al., “Efficient Modulation for Band-Limited 
Channels,” IEEE J. Select. Areas Commun., SAC-2, pp. 632-647, 
September 1984. 

[3] A. R. Calderbank and N.J.A. Sloane, “New Trellis Codes,” IEEE 
Trans. Inform. Theory , to appear. 

[4] L. F. Wei, “Trellis- Coded Modulation with Multi-Dimensional Con- 
stellations,” IEEE Trans. Inform. Theory , to appear. 


20 


[5] G. D. Forney, Jr., “Coset Codes,” IEEE Trans. Inform. Theory , to 
appear. 

[6] F. Hemmati and R.J.F. Fang, “Low Complexity Coding Methods 
for High Data Rate Channels,” Comsat Laboratories Technical Note, 
February 1984. 

[7] A. LaFanchere, R. H. Deng, and D. J. Costello, Jr., “Trellis Coded 
Phase Modulation, Using Unit-Memory and Partial-Unit-Memory 
Convolutional Codes,” submitted to IEEE Trans. Inform. Theory , 
January 1987. 

[8] S. G. Wilson, “Rate 5/6 Trellis-Coded 8-PSK,” IEEE Trans, on Com- 
mun., COM-34, pp. 1045-1049, October 1986. 

[9] S. G. Wilson, H. A. Sleeper, II, P. J. Schottler, and M. T. Lyons, “Rate 
3/4 Convolutional Coding of 16-PSK: Code Design and Performance 
Study,” IEEE Trans, on Commun., COM-32, pp. 1308-1315, December 
1984. 

[10] G. D. Forney, Jr., Concatenated Codes, Cambridge, MA: MIT Press, 
1966. 

[11] J. P. Odenwalder, “Optimal Decoding of Convolutional Codes,” Ph.D. 
Thesis, University of California, Los Angeles, 1970. 

[12] L. N. Lee, “Concatenated Coding Systems Employing a Unit-Memory 
Convolutional Code and a Byte-Oriented Decoding Algorithm,” IEEE 
Trans. Commun., COM-25, pp. 1064-1074, October 1977. 

[13] M. A. Herro, D. J. Costello, Jr., and L. Hu, “Capacity and Cutoff 
Rate Calculations for a Concatenated Coding System,” IEEE Trans. 
Inform. Theory, to appear. 


21 


[14] S. Lin and D. J. Costello, Jr., Error Control Coding: Fundamentals 
and Applications , Englewood Cliffs, NJ: Prentice-Hall, 1983. 

[15] T. Fujino, et.al., “A 120 M bits/s Coded 8-PSK Modem with Soft- 
Decision Viterbi Decoder,” IEEE Int. Conf. on Communications Con- 
ference Record , June 1986, Toronto, Canada. 

[16] G. W. Zeoli, “Coupled Decoding of Block- Convolutional Concatenated 
Codes,” IEEE Trans, on Commun., COM-21, pp. 219-226, March 
1973. 

[17] H. Yamamoto and K. Itoh, “Viterbi Decoding Algorithm for Convo- 
lutional Codes with Repeat Request,” IEEE Trans. Inform. Theory , 
IT-26, pp. 540-546, September 1980. 

[18] T. Kasami, S. Lin, and T. Taka t a, “A Cacaded Coding Scheme for 
Error Control,” IEEE Trans. Inform. Theory , to appear. 


22 



Table 1. Coding gain over QPSK of Example 2.3 
with R^ = 5/6 coded 16-PSK inner code (formula calculation) 


# o f inner 
code states 

Rgff, bits per 
dimension 

Coding gain at 10“^ 
(E b /N 0 )dB 

Coding gain at 10“9 
(Eb/No)dB 

2 

1 

2.85 

5.08 

4 

1 

5.07 

7.13 

8 

1 

3.65 

6.28 

16 

1 

4.15 

6.35 


Table 2. Coding gain over QPSK of Example 2.3 
with R^ * 3/4 coded 16-PSK inner code (formula calculation) 


# of inner 
code states 

Reff, bits per 
dimension 

Coding gain at 10“^ 
(E b /N 0 )dB 

Coding gain at 10' 
(E b /N 0 )dB 

2 

1 

2.05 

4.18 

4 

1 

3.60 

5.77 

8 

1 

3.22 

5.45 


16 


1 


3.22 


5.47 


















Fig. 2.4 E^/Nq required to achieve = 10 and 10 vs. R e £f 

for Example 2.1 with 16-state PTVTC inner codes (simulation) 
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Fig. 8.1 Performance of Example 4.1 (simulation) with 

16-state R^=7/9 PTVTC 8-PSK inner code and B=7, 

T ,= 0. 2, T es =10, d 2 =37, R eff »l. (1) E b /N Q =6.1 dB 

(P b =2.5xl0 -6 in scheme 1); (2) E b /N Q =6.34 dB (P b = 
8.5x10 ® in scheme 1). 
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Performance of Example 4.2 (simulation) with 
16-state R^=2/3 coded 8-PSK inner code and B=4, 

T'=0.3, T es =10, d 2 *26, R eff =0.9. (D ^^o” 5,07 dB 

(P =1.11x10 ^ in scheme 1); (2) E,/N n =5.45 dB 

b -10 D 
(P =5.3x10 in scheme 1). 
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ABSTRACT 

Bandwidth efficient codes can be used to improve the performance of digital transmis- 
sion on bandwidth limited channels such as the telephone channel. An important class of error 
correcting codes suitable for bandwidth limited channels is referred to as Trellis Coded Modu- 
lation (TCM). Such codes combine a binary convolutional encoder and a modulation scheme. 
Recently, many good TCM schemes have been found. In this paper, we derive a lower bound 
on the minimum free Euclidean distance of TCM which guarantees die existence of good 
TCM codes of any complexity. We use the bound to compare trellis codes combined with 
PSK, PAM, or QASK modulation. We also compare the bound with known upper bounds. 
This random coding bound is exponentially tight for large constraint lengths and predicts the 
asymptotic performance of TCM when the complexity of the code becomes large. The bound 
can be used with any code rates and any modulation scheme and shows that the free distance 
increases linearly with the constraint length for large values of the constraint length. 


I INTRODUCTION 

The transmitter power and the channel bandwidth constrain the design of a digital com- 
munication system. On some channels the bandwidth constraint prevails over the power con- 
straint For example, on the telephone channel (a typical bandwidth limited channel with 
bandwidth W = 3 kHz), a large Signal-to-Noise Ratio (SNR) of around 28 dB permits the use 
of several signal amplitudes with a moderate error probability [Forney et al., 1984]. On such 
channels, efficient coding systems transmit at more than one information bit per signal dimen- 
sion. On satellite channels, the power constraint used to prevail over the bandwidth constraint. 
But the increasing need for bandwidth efficient satellite communication systems makes the 
satellite channel more bandwidth limited. Bandwidth efficient codes increase the power 
efficiency on bandwidth limited channels, i.e., for a given bandwidth and rate of information 
bits sent per transmission interval, a desired error probability can be achieved with less average 
signal energy. If Additive White Gaussian Noise (AWGN) perturbs the channel, the error pro- 
bability of an uncoded system depends on the minimum Euclidean distance between signals. 
We will define the minimum Euclidean distance more precisely later. Distributing a given 
average energy into more signals without coding reduces the distances between signals and 
increases the probability of error. A bandwidth efficient coding scheme lowers this increased 
error probability to that of an uncoded scheme of larger average energy by introducing 
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redundancy between successive symbols. In other words, a bandwidth efficient scheme 
transmits with the same rate and error probability as an uncoded system with larger average 
energy. Alternately, a bandwidth efficient scheme can transmit more information bits per 
transmission interval than an uncoded scheme with the same average energy and error proba- 
bility. However, the tradeoff is increased complexity. TCM is an application of Shannon’s 
information theory, and cutoff rate and charnel capacity calculations show that 6 dB to 9 dB 
can be gained over an uncoded system at an error probability of 10 -5 to 10 -6 [Ungerboeck, 
1982]. 

H DEFINITIONS 

A typical trellis coded modulation (TCM) scheme includes a binary encoder followed 
by a modulator (Fig.l). During every signaling interval [rr,(r+l)7] of length T, k input bits 
u[®, i = 1 , . . . , k, enter the encoder, and n output bits vj\ j = \ , n, leave the 

encoder, k is the information stream width. For any t, there are 2 k possible input blocks 
( 4\ . . . , u$ ) and 2" possible output blocks ( v^, .... ). R b ^ kin is the 

rate of the binary encoder. The memory order of the encoder is V, and every output 
depends on the current information block ( k bits) and on the v previous blocks (yk bits). 
However vj\ j = l n, may not depend on all the vk past information bits, and the 

number of memory registers v,- that store one input stream may depend on i, so that 

k 

0 £ V; < v. We call v 0 = £ V; the total memory of the TCM scheme. In this paper, we 

i = l 

restrict our study to the case where V; = v, i = 1, . . . , k, so that the constraint length is 
Vq = kv, i.e., we assume that the encoder contains k memory registers of equal length v. The 
encoder of a TCM scheme is a finite state machine with 2 V ° states, where the contents of the 
v 0 binary registers specifies the encoder state. 2 k branches corresponding to all possible input 
blocks enter and leave each state. The modulator maps each of the 2" binary n-tuples leaving 
the binary encoder onto a signal constellation. A signal constellation 5 is a finite set of M sig- 
nal symbols and can be generated from a set of D orthogonal signals (sig- 

nal space representation). In our case M = 2 n , and the signal constellation has D dimensions. 
We define the rate of a TCM scheme as R = klD, the number of information bits transmitted 
per dimension. The codewords in a TCM scheme can be represented by a trellis whose 
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branches are labeled with channel signals, and TCM schemes are sometimes called trellis 
codes. A (k, v) trellis code is defined by a trellis with 2^ states and 2 k branches leaving each 
state and entering 2 k different states, and a time-varying mapping of signals in S onto 
branches. The complexity of the decoder is a function of 2^ v ° + ® = 2 k + (Note that cer- 
tain authors define the constraint length as n(v + 1) [Lin-Costello, 1983].) 

Binary and quaternary phase shift keying (BPSK, QPSK) send one information bit per 
dimension (R = 1). Coding on bandwidth efficient systems requires signal constellations with 
more than 2 signals per dimension, for example 8-PSK, to send more than one information bit 
per dimension (R > 1). These higher alphabet modulation schemes can be used to generate 
redundancy in the transmitted message to improve the reliability of forward error correction 
(FEC) coding. In conventional FEC coding systems, the encoder is designed to maximize the 
minimum Hamming distance between binary codewords (n-tuples v^). Then these binary 
codewords are mapped onto a signal constellation disregarding the geometry of the constella- 
tion. Nonetheless, for AWGN channels, the channel error probability between channel signals 
y t and y{ in S depends on the Euclidean distance d e (y t , y t ') between signals y t and y’ t in the 
signal constellation 5. The Euclidean distance can be made proportional to the Hamming dis- 
tance only for BPSK and QPSK modulation. This is why for TCM schemes that use signal 
constellations other than BPSK or QPSK, the encoder and the modulator are selected joindy to 
maximize the minimum Euclidean distance between codewords rather than the minimum Ham- 
ming distance. 

The signal error probability P s of a trellis code used on an AWGN channel and 
decoded with a maximum-likelihood sequence estimator (Viterbi algorithm) depends on the 
Euclidean distance between channel signals [Viterbi, 1971]. P s is upperbounded by: 

P s <4 Z me**'* , (1) 

d = dfr„ 

where the stun is taken over all possible distances d between distinct paths. dp. ee is the 
minimum Euclidean distance between distinct paths, A(d) is the path multiplicity of distance d, 
i.e., the number of error paths at distance d from the correct path, E s is the average signal 
energy of the constellation, Nq is the one-sided power spectral density of the Gaussian noise, 
and EJNq is the SNR of the channel. Equation (1) means that the performance of a TCM 
scheme depends on the distribution of distances between encoder output sequences correspond- 
ing to distinct encoder input sequences. For large values of SNR, the sum of (1) is equivalent 
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to 1/2 A(dp ee ) e ifr “ E,lNo , whose behavior is mostly determined by the exponent, or simply 
by dj ree . Therefore, for large SNR, a small error probability P s corresponds to a large dfree- 
We call d min the minimum Euclidean distance between signals in a constellation. Then the 
asymptotic coding gain (for large SNR) of a TCM scheme over an uncoded system is given by 

G £ 10 log io - 7 —, where cL.^ is the minimum Euclidean distance of the TCM scheme, and 
^min 

d m in is the minimtnn Euclidean distance of the uncoded scheme. 

In this paper we derive a lower bound on the minimum Euclidean distance of TCM 
schemes. The bound is similar to the bound of Costello [Costello, 1974] and Forney [Forney, 
1974] on the free distance of convolutional codes. The bound guarantees the existence of codes 
that perform better than the average. We use this bound to compare different modulation and 
coding schemes, such as those proposed by Ungerboeck [Ungerboeck, 1982], Calderbank and 
Sloane [Calderbank-Sloane, 1985], and Lafanechere, Deng, and Costello [Lafanechere-Deng- 
Costello, 1987]. We also compare it with Calderbank, Mazo, Shapiro, and Wei’s upper bound 
[Calderbank-Mazo-Shapiro, 1983 and Calderbank-Mazo-Wei, 1985]. The main advantage of 
using a bound to compare schemes over searching for good codes is that we do not use 
exhaustive methods but rather optimize the bound and its parameters. This reduces the com- 
plexity of finding good TCM schemes, although we cannot guarantee that we find the optimum 
schemes, because there may exist codes that perform better than the bound. However the best 
known codes behave in accordance with the bound. The bound can also be used to predict the 
performance of new schemes and, if promising, then actual codes can be constructed. 


An efficient TCM scheme assigns channel signals to branches to achieve maximum 
dfr ee and minimum error probability P s when using maximum likelihood decoding. The lower 
bound on dp. ee uses a random coding argument based on the moment generating function 


e a ? dl £ 

1 


z 

i 


, ~ a4(y,y) 


where the overbar indicates an average over the ensemble of all 


code choices (for a given trellis), the first sum is over all choices of paths y_, the second sum is 
over all paths diverging from £ at time 1 , a and p are arbitrary constants, d is the lower 
bound on d^ ee> and d e iyj£) is the Euclidean distance between the channel paths y_ and }>'. The 
lower bound depends on V, k, and the distance distribution of the signal constellation. 
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m DERIVATION OF THE BOUND 

For simplicity, we restrict our notation to TCM schemes. However, the derivation, 
except for configuration counting, applies to any Coded Modulation scheme. We call a topo- 
logical trellis a trellis with no signal assignments on the branches, i.e., a trellis with no labels. 
A topological path Y through a topological trellis is a sequence of consecutive branches with 
no labels. A channel path £ is a path through a trellis labeled with channel signals on the 
branches. To a topological path Y corresponds an infinite set of possible channel paths y, but to 
each channel path y corresponds one and only one topological path Y. A channel path y is 
defined by a topological path Y and a sequence of labels: 
y =(•••> Yj> 2/+i» • • • , yj+p ■ . . ), where yj +t is a branch in Y labeled with a channel sig- 
nal in S and j+t is a time index, so that y: +t is the output signal during time interval j+t. We 
will sometimes call y t a signal although it is a labeled branch. The context should make clear 
when we mean a labeled branch or a signal point. A code c is the set of all labeled trellis 
branches for all t. If the code is time-varying, labels depends on t. A code c generates a 
unique set of infinite channel paths y, each path being a set of consecutive branches in c. To 
construct a code, one only needs to label trellis branches with channel signals. The labeling is 
the same for every signal interval if the code is time-invariant. We consider the set C of time- 
varying trellis codes of a given input width k and a given memory length 
Vj = ... = = v. The input width k and the memory length V fully determine the trellis 

topology, i.e., the number of states, the number of branches to and from each state, and the 
states linked by a branch. Note that 2 k branches leave each state and enter 2 k different states. 

We now proceed to derive the bound. The proof of the bound depends on first stating 
precisely what must be proved. 

Statement 1: 

Given a set C of codes c, there exists a code c 0 6 C 
ity, over all codes c in C, that d^. ee {c) < d is less than 

We can rewrite the previous statement as: 

Prob ( d free (c ) <d)<l => Bc 0 e C s.t. d free (c Q ) > d. (2) 


for which dfi. ee (c Q ) > d if the probabil- 
1 . 
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By definition (U ee (c ) = min <i e (22O , where (2,2') varies over all pairs of distinct paths 2 

(MO ^ J 

and ^ in c diverging from the same state at time instant 1. Such a pair (2,20 of paths is usu- 
ally called an error event in the convolutional code literature. We call e ^ (c) the set of paths 
2' e c such that (2,20 is an error event starting at time 1, i.e., 2 and 2* are merged at time 1 
and may remerge later or may never remerge. The notation (c) stands for "the set of error 
paths in c given a correct path 2"- The free distance of a code c is: 




min 

hi') 



<2 € c 

t 6 (c) 


(3) 


Thus, if for a code c and a positive real number d, dj ree {c) < d, at least two distinct paths 
2o € c and 2o e e io exist such that d e (2o,2o) < d. Thus for any positive real number 
a, 

e od\-od\^ 9.Z0O > L (4) 


We upperbound the left side of (4) by summing over all (c): 


V a > 0, X 


ad ‘ ( l0’l r ) 


* 1, 


/ e ^ (c) 


V a, p > 0, e“P d2 


I « 

z' e (c) 




> 1. 


(5) 

( 6 ) 


The previous equation holds for a given correct path 2 q. We now upperbound the left side of 
(6) by summing over all possible correct paths. 2 in c: 


V a, p > 0, e a ^ X 

X € C 


X * 

ly's Cj(c) 




> 1. 


(7) 


We define a function T a> p(c) of a code c by: 


. T«.p(.0 4 1 

J€C 

where cx and p are two positive parameters. 
V c e C and V a, p, d > 0 , 


- ctdXjtf 


hfe Cj (c) 


Then (7) and (8) imply that 


e “P* T ap (c) > J 1 lf dfree(C) ~ d 
[0 if d free (c) > d 


( 8 ) 


(9) 
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Let us define an indicator function on the set C as follows: 


* d> 0, X d (c) 4 J 


1 iff d free (c) < d 
0 iff d fr Jc)>d 


Then for any positive real number d. 


Prob (dp. ee (c) <d)= £ p(c) X d (c) , 

C € C 1 - 


c e C 


( 10 ) 


(ii) 


where p(c) is the probability of the code c among the set C of time- varying codes associated 
with a given trellis and a given signal constellation. Clearly, from (9)and (10), 

V c € C and V «, p, d > 0 , X d (c) < e T a>p (c). (12) 


If (12) were an equality, then (11) and (12) would give an exact formulation of 

Prob (dfr ee (c) < d). So the tightness of (12) will determine the tightness of the bound. Com- 
c e C J 

bining (11) and (12) yields 

Prob (d free (c) < d) < X P( c ) T a,p( c )> (13) 

ceC c6 C 

where a and p are positive parameters that will be used later to optimize the bound. We now 
give a weaker but more explicit version of statement 1. 


Statement 2: 

For any set C of codes c, 

e apdl X P( c ) T a,p( c ) < 1 => 3c 0 e C s.t. d free (c 0 ) > d. (14) 

C € C 

In the following, we upper bound the left side of (14) to obtain an even weaker, but 
very explicit statement. The derivation of a random coding bound is based on switching the 
order of summations [Shannon, 1962], The derivation starts with an average over all codes and 
switches summations to obtain an average over all the output sequences or channel paths. We 
explain Shannon’s concept with a formal example. Let c be a code in C, ^ a channel path of 
the code c, and f(c,y) a function of the code c and of one of its channel paths y_ (^ is often 
called the correct path). According to our notation, c e C and ^ e c. Then switching the 
summations gives: 
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2 2 K‘,D = 2 2 /fez) . «5) 

ceC £€ c 2 e kJ c ceC ll 

ee C 

where y_ e c denotes the set of all channel paths belonging to at least one code c in C and 

C 6 C 

c e C | 2 denotes the set of all codes in C that contain the channel path y. To simplify 

further, we will replace the sum over j|S y c with a sum over z (not to be confused with 

c e C 

the more restrictive sum over £ € c) and the sum over c e C | z by a sum over c | z (not to 
be confused with a sum over c e C). 

Let us now introduce a function 

GQ,c)£ X (16) 

t e (c) 

G(y_,c) is the distance generating function of the code C given that the correct path is y_. For 
binary linear codes and regular codes [Calderbank-Sloane, 1985], G(j,c) does not depend on 
the correct path z From the definition of G(y,c), 

X P( c ) T’a.pfc) = X P(c) X G&c)P • (17) 

ceC ceC yec 

Applying Shannon’s summation switching, 


x p(c) r^pCc) = X E G &’ c ) p - 

c g C E c I Z 

Since, for all codes c | z , p(c) = p(c,y) = p(c | v)p(v), 


E E P( c ) G (zc) p = X IP( c li) G(&cf. 

i c ly i c lz 


If we use the moment inequality 


E Pi a i 


< 


nP 




for p < 1, then 


Z P( c I 2) G(%,c) P < 

c I l 


E p(c I z) G (z> c ) 
c 1 1 


(18) 


(19) 


( 20 ) 
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We now switch the order of summations over the codes c \ y and the paths y: 

2 P(c I 2 ) Gfcc) = XMdl) 2 

c I 1 C I £ 6 Cj (c) 

= 2 2 Ptli (21) 

z' £ «i c 1 i>i 

where e y £ u e y (c). Since, for all codes c | y,y, p(c \ y) = p(c,y' I 2 ) = 
p(c | y,y) p(y' I z), (21) becomes: 

E P( c I Z) G(Z,c) = E 1 Z) « ~ ^ E P( c I m0> 

c I Z Z' e C I Z-Z' 

and since £ /?(c | y,y) = 1, 

C I m' 

2 />(c I 25 G&c) = £ K2' I 25 «' <te2 ' ) , (22) 

c 1 1 Z' e ^ 

where we have eliminated all the branches that do not belong to y or to y', and hence do not 
contribute to the error event. When C represents the set of all time varying trellis codes asso- 
ciated with a particular topological trellis, the probability of choosing a signal on one branch 
during time interval [tT,{t+\)T\ does not depend on the signals chosen during previous or 

00 

future intervals. Then if t represents a time index, p(y | y) = n p(y t '\yt), where y{ is the 

t= 1 

signal on the i h branch of the path y, and p(y/\y t ) is the probability of having y t ' on that 
branch given y t . Normally, all channel signals are equiprobable and can be chosen indepen- 
dently, so that for y t ^y t ' P^Yt^t) = p(yf) = 1/A/, where M is the number of signals in the sig- 
nal constellation S. However, for the derivation of the bound channel signals do not have to be 
equiprobable. Finally, from (18), (19), (20), and (22) 

00 " P 

2 p(c) T afi (c) S 2 ?<25 2 n PWM . (23) 

ceC 2 yl e <? 2 t= 1 

Equation (23) holds for any trellis and does not depend explicitly on the encoder input width k 
or on the constraint length V 0 . The next step, what Forney calls configuration counting, con- 
sists of regrouping the paths y (respectively y) that correspond to the same topological paths Y 
(respectively Y). 
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Configuration counting: 

Given y, configuration counting regroups error events (y,y) of equal length X, and then 
regroups channel paths yf that correspond to identical topological paths Y. By definition, an 
error event (yytf) of length X starting at time 1 and corresponding to a topological error event 
(Y,Y) satisfies 

Y t *Y t ' for 1 < r < x 
Y t = Y; for x < f , 

which means that the branches are different for X intervals, but the signals can be the same. In 
other words, y t *= for 1 < t < x, but y t and y t ' can still be labeled with the same signal 
point. Then, if (y,y) is an error event of length X starting at time 1, 

fi P(y/W = n pCy/fi) e-^^. (24) 

t= 1 t= 1 


An error event lasts at least V + 1 information blocks. The input block of width k that 
feeds the encoder at time 1 differs for paths y and y. This different input block is shifted into 
memory, and it takes at least V time units to clear the memory of that difference. So two paths 
that diverge at time 1, cannot remerge before time V + 1. We define e^ x as the set of paths y 

oo 

in such that the error event fy,y) lasts x branches. Then, e^ x . Hence, com- 

T = V + 1 

bining (23) and (24), 


X P( c ) T a> p(c) < 

c e C 


Xp(y) 

i 


X 

|t = v + 1 


X 

i 


n 

t = i 


p(y t '\y t ) e 


- od](y h y l ') 


(25) 


where we regrouped error events of length X in e^, starting with the shortest error events of 
length v + 1. We now regroup paths y in e^ x that refer to the same topological path Y, but 
with different signals on the branches. We note by y — > Y the fact that Y is the topological 
path associated with y, i.e., that y is a labeled version of Y. We define E^ x as the set of 
topological paths Y associated with elements of e^ x : E^ x = ■{ 7' \ y Y, y e e ^ x \- . 
Therefore, 


£ fi p(y,'\y.) = £ £ fi f’O’/W 

Y e e v 1 = i e Y\Y~*Y f = i 


(26) 
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The choice of a signal on a branch y( does not depend on the signal labeling the branch y t , 
since different branches of a code are labeled independently. Therefore p(y t '\y t ) = p(y t '), and 
all topological paths of equal length x generate the same sum over all the channel assignments: 

Z n p(y,’ly,) = z n PW ( 27 ) 

i'\Y -+¥ t = 1 /e? i=i 

We call C^ x the cardinality of E^ z . C^ x is the number of topological error paths of 
length X diverging from Y at time 1. C^ x does not depend on the correct path y, so that 
x = C T . Hence, (26) becomes 

Z ri P<yt I y,) = C, Z n PM , (28) 

Y e t = 1 6 S' f=l 


and (25) gives 


X />(c) T’a.pOO ^ XK$ 

c € C 2 


z c, z n pw 

[x = v + 1 £ e 5? t= 1 


• (29) 


Using the inequality 


Z a i 

i 


P _ P 

< £ a,- for p < 1, we get 


X p( c ) r tti p(c) — X 2 pCx) C x 

c e C t=v+1^ 


X II p(y/) * 

/ 6 S' r = 1 


- ad\[y„yf) 


• (30) 


We now write p(y) = n p(y/): 
i = 1 


x p( c ) t^{c) < £ c x p x n pO 7 /) 

C E C T = V + 1 £ 1= l 



X 

y € 5 


p(y') e 


- a^OwO 


P 


(31) 


= x c x x n ii 

T = V + 1 2^=1 f = 1 


X p(y') e ~ a4o ' ,y) 

y € s 


(32) 


Now let us introduce the notation: 


H(y t ,S)± 2 

/€ 5 


(33) 
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H(y t JS) gives the position of the signal y t relative to the constellation S. Hence, 

IP 


2 n m n 

i 1=1 /= i 


2 jKjO 

y’e 5 


= X n p(yi) n yps ) 9 • ( 34 ) 

2 1 = 1 f=l 


Now we eliminate the branches of y that belong to y in the same way that we eliminated the 
branches of y that belonged to y. We calculate the right side of (34): 


x fi pw n h w) p = x n pw # 0 ^ n pw 


(35) 


i 1 = 1 

t= 1 

2 1 

= i 

/= 1 













r i 



= n 

X P(y ) H(y,S)P 

n 

X p(y) 



t = i 

ye 5 


>65 


(36) 


Since X p(y ) = 1, 

yeS 


x fi p(yi ) n #o^) p = n i p(y) H(yj)p 

y_ 1=1 t = 1 «=1 


X pOO //(y,5)p 

> € S 


(37) 


(38) 


Returning to equation (32): 


X P( c ) r a,p( c ) ^ X tf Q 

ceC T= V + 1 


X pOO H(y^) p 

>65 


(39) 


£ 

The cardinality C T of is C x = J“[ 2^ _v ^ + , where [x] + = a: if * > 0 and zero otherwise 

j=l 

[Calderbank-Mazo-Wei, 1985]. Hence for any t>v+l , C z = 2 kz ~ ks ' , where 
Vi = . . . = v* = v. 

According to Forney’s notation [Forney, 1974], we define 

|l/p 

, (40) 


£(a,P)= -In 

XpOO 


p' 


>65 

>'65 



where y and y belong to the signal constellation S, and p(y) is the probability of signal y. 
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A simpler form of (39), using the definition of £(a,p), is 

I p{c) T^ic) < £ C T P e-^ E ^ 


ceC 


x — v + 1 


(41) 


£ X 2** p xp£(a,p)_ 

t = v + 1 

We can now write a third statement that follows from statement 2 and (42): 


(42) 


Statement 3: 

For any set C of trellis codes of input width k and memory length v, 

e“P dl £ 2** - e -*%*.?) < i => □ c Q t=Cs.t.d free (c Q )>d. (43) 

T = V + 1 


Statement 3 is as tight as statement 2 when p = 1. We calculate the largest d such that ine- 
quality (43) holds. If E(a,p) > &ln2, the left side of (43) can be written as 

e apd 2 - *vp ln2 g _ - p[v£(a,p) - ad 1 ] L 


1 _ e ~ p[£(a,p) - *ln2] ' e p[£(a,p) - k\n2] _ 1 

The previous expression will be smaller than 1 (i.e., (43) is satisfied) if and only if 

d 2 < \ £ ( a ’P> _ 9 t £ ( a >P)] 


a 


a 


(44) 


where 


e [e(a,p)] = In [eP( £ (“-P)-^ - l] 1/p . 


(45) 
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This concludes the derivation of the bound. The bound can now be put in the form of 
Costello’s bound [Costello, 1974]. 


Theorem: 


Given a signal constellation S, there exists a (k, v) trellis code with minimum free Euclidean 
distance dp. ee such that: 



> max 


v 


£(a,p) > klnl'- 


0 < a 
OSpS 1 

p(y) 


Ej a , P) 

a 


er m , p) ] 

a 


(46) 


where k is the information block size, V 0 = kv is the constraint length, S is the signal constel- 
lation, a and p are parameters which optimize the bound, y and y' are signal points from the 
signal constellation S, d e (y,y) is the Euclidean distance between signals y and y , p(y) is a 


probability distribution on S, £(a,p) = - In 

L p<y) 

X Pb 

p ' 


_,€J 

y'eS 

* « 

* 


e[£(a,p)] = In (eP[£(a.P)-*to2] _ ^ 


IV APPLICATION OF THE BOUND 

Symbols y that label branches of the trellis can represent subsets of the constellation 
when the trellis contains parallel transitions. The subset y is composed of the signals assigned 
to a parallel transition. The encoder outputs sequences of subsets rather than sequences of sig- 
nals. This enables us to study the effects of uncoded bits, or, equivalently, the effects of map- 
ping signals onto a trellis with parallel transitions using Ungerboeck’s "set partitioning" method 
[Ungerboeck, 1982], In that case, if d miri is the minimum distance between points in the same 
subset, and S^ sets ^ the free Euclidean distance between sequences of subsets, then the overall 
minimum Euclidean distance of the TCM scheme is dfree = min <rf$ sets) , d^. This 
shows that parallel transitions limit the maximum achievable d^ ee of a TCM scheme. 

An evaluation of the bound corresponds to a trellis characterized by k and v 0 = kv and 
represents the average free distance of all the corresponding trellis codes (all labelings of the 
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trellis). However, since the bound is a random coding bound, it does not represent the best free 
distance achievable with a particular trellis, because optimum trellis codes may perform better 
than the average. 

Like the lower bounds on the Hamming [Costello, 1974 and Forney, 1974], this 
lower bound on the Euclidean is exponentially tight and varies linearly with Vq for large 
v 0 . Like any other "Gilbert type" bound, it is not expected to be tight for small V 0 . 

The bound depends implicidy on the constraint length Vq = kv and on the information 
rate per dimension R. Although only certain discrete values of V 0 and R are possible, we treat 

dfr ee ( v o) an d — — (R) as continuous functions. 

V USING THE BOUND TO EVALUATE TCM SCHEMES 
1) Variations of d^ ee with V 0 : 

Bounds on the free Hamming distance of binary convolutional codes have usually been 
expressed as functions of v 0 . We compare TCM schemes with different constellations at a 
given rate R and at a given average energy E^g, where E m g is the average signal energy of 
the constellation used in the TCM scheme. The lower bound on dp, ee increases with the con- 
straint length Vq, which means that any dp. ee can be obtained with a large constraint length. 
However the decoding complexity increases exponentially with the constraint length, so that 
practically only small constraint lengths will be implementable . 

* The effect of increasing the number of signals on dj- ree (same rate R, same dimen- 
sionality D, same average energy E avg ): 

Increasing the number M of signals increases dj- ree because it provides more flexibility 
in the choice of signals assigned to trellis branches (flexibility in the mapping). Signals with 
greater distance can be chosen on the branches of error paths given a correct path which 
increases dp. ee . Note that increasing M, while keeping k constant, lowers the binary code rate 
Rb, which is consistant with the fact that dp. ee increases. Fig.2 shows PAM schemes with 
different numbers of signals (M = 4, 8, 16, 128). Increasing the number of signals provides 
more gain for small constraint lengths V 0 than for large constraint lengths. In Fig.2, going 
from 4-PAM to 8-PAM or even 16-PAM provides most of the gain and higher order PAM 
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schemes do not improve dp. ee significantly. Note that the gain in d^ ee may not be worth the 
added modulation complexity. 

* The effect of increasing the signal set dimensionality on d^ ee (same rate R, same 
number of signals M, same average energy E mg ): 

Higher dimensional schemes can be obtained by using a basic D-dimensional constella- 
tion for several transmission intervals [Wei, 1985] and [Lafanechere-Deng-Costello, 1987]. 
Fig.3 shows L-dimensional schemes for which the constellation is constructed from L uses of 
4-PAM (1 -dimensional). This construction has the advantage of retaining the distance proper- 
ties of 1- or 2-dimensional constellations so that the only parameter of the TCM scheme that 
varies is D. For a given constraint length V 0 , higher L-dimensional schemes yield a larger 
dp. ee , and as Vq increases the curves diverge and the gain goes to infinity. However, the error 
coefficient increases with dimensionality [Forney et. al., 1984], which may cancel the gain 
from the increased dp. ee at moderate decoded bit error rates. 

* The effect of changing the modulation scheme on df ree (same dimensionality D, same 
number of signals M, same average energy E avg , same rate R ): 

Changing the modulation scheme affects dp. ee . For example, rectangular constellations 
yield a larger d^. ee than constant envelope constellations. Fig.4 shows M-PSK vs. M-QASK. 
This suggests that a dp. ee penalty is associated with constant envelope modulation. Note that 
this penalty increases with the constraint length. Furthermore, the bound gives a means of 
optimizing some parameters associated with a specific type of constellation without searching 
exhaustively for the best constellation. 

* The effect of uncoded bits or parallel transitions on dp. ee when the signal constellation 
is "mapped by set partitioning": 

Uncoded bits generate parallel transitions in the trellis and limit dp. ee , since df ree cannot 
be larger than the minimum distance between parallel transitions (the minimum distance within 
the subsets created by "set partitioning"). Fig.5 shows that for 8-PSK, 1 uncoded bit increases 
dfree f° r small constraint lengths. For small Vq, this agrees with the best known codes con- 
structed [Ungerboeck, 1982], For large V 0 , however, we see that 1 uncoded bit limits the 
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achievable d^. The bound shows whether uncoded bits combined with mapping by set parti' 
honing will increase dp. ee for small constraint lengths v 0 . 


2) Comparison of the lower bound on dp. ee with upper bounds [Calderbank-Mazo-Wei, 1985 
and Pottie-Taylor, 1986] and known codes [Ungerboeck, 1982]: 

Upper bounds are expected to be tighter than the lower bound for small constraint 
lengths, and the lower bound is expected to be tighter for large constraint lengths (this is analo- 
gous to the way bounds on binary convolutional codes behave). The lower bound is a 
"Chemoff type" bound and therefore is exponentially tight when v 0 is large. Fig.6 indeed 
shows that upper bounds are tighter than the lower bound for small constraint lengths. But the 
lower bound becomes tighter as v 0 increases. The slope of the lower bound gives a precise 
indication of the asymptotic rate of increase in d^. ee . Finally, the lower bound guarantees the 
existence of codes that can achieve a certain dj- ree . 


3) Asymptotic behavior: 


dfree 

v 0 


(JR) 


For a given constellation, the achievable dp. ee / v 0 is larger for small R than for large 
R. Graphs of (R) give a means of comparing the asymptotic behavior of df ree for 


different constellations. Several examples are shown in Fig.7. The largest achievable rate R 
depends on the constellation and its dimensionality : it is (/? i ,log 2 M) ID for a code rate R^. 
These largest achievable rates appear clearly in Fig.7. Rectangular lattices constructed from 
the same 1 -dimensional constellation (for example 4-PAM in Fig.7) yield the same graph, 
which means that these rectangular lattices have the same asymptotic behavior. In other words 
the curves dj- ree (v 0 ) are parallel for large v 0 . The lower bounds are compared with an asymp- 
totic upper bound [Calderbank-Mazo-Wei, 1985] in Fig.7. 


CONCLUSION 

We have obtained a random coding lower bound on dp. ee . It takes into account the 
actual Euclidean distances between points of the channel signal constellation. As with any 
"Gilbert" bound [Gilbert, 1952] it is most useful as an asymptotic bound. It provides a means 
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of comparing the asymptotic performance of different modulation schemes and shows that the 
free Euclidean distance increases linearly for large constraint lengths. However, even for short 
constraint lengths, the bound behaves similarly to the best known codes and can be used as a 
preliminary calculation when designing TCM schemes to help select the most promising 
schemes. 
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Figure 1: Trellis Coded Modulation Scheme. 
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Figure 5: Trellis coding and uncoded bits (Ungerboeck) for 8-PSK, 
R = 1 bit! dim, E avg = 1 J/dim. 
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Figure 6: Comparison of the lower bound on dp. ee with upper bounds 
(CMW and PT) and with the best known codes for 8-PSK, 
R = 1.5 bits! dim, E avg = 1 Jtdim. 
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Figure 7: Asymptotic bounds vs. rate, E avg = 1 JlcLim. 
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BANDWIDTH EFFICIENT CODING ON A FADING CHANNEL 

Christian Schlegel and Daniel J. Costello, Jr. 
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Notre Dame, Indiana 46556 

INTRODUCTION 

Communication over real world channels, particularly mobile satellite channels, suffers from 
nonuniform disturbances that cannot be described by additive white Gaussian noise. One 
important such disturbance is the multiplicative noise arising from multipath reception 
at the receiver (see Hagenauer [1]). Communication channels suffering from multipath 
reception are known as Rayleigh/Rician fading channels. If Si(t) is a signal that is sent over 
a fading channel, then the received signal r,(t) is given by 

r,*(<) = asi(t) + n(t), (1) 

where n(t ) is additive white Gaussian noise and a is the fading amplitude whose probability 
distribution is given by 

p(a) = 2a(l + K)e- K ~ a2 ( 1+K )l 0 (2a^K(l + K)) , (2) 

where Iq denotes the modified spherical Bessel function. K is the ratio of the signal energy 
received on the direct path to the energy received via reflected paths (compare [1], [2]). 

Due to the increasing density of communication systems most digital channels are power 
as well as bandwidth limited. Ungerboeck [3] has introduced bandwidth efficient trellis 
coding which provides a coding gain of up to 6dB on additive white Gaussian noise channels. 
In this paper we look at the performance of these bandwidth efficient trellis codes on 
channels plagued by fading. We will limit our attention to the Rayleigh fading channel, i.e., 
K = 0 in (2). 


CUTOFF RATE FOR RAYLEIGH CHANNELS 

Since a great variety of signal constellations can be used on the Gaussian channel, it is 
instructive to look at the cutoff- rates of these constellations on a fading channel. The cutoff 
rate Ro is the exponent in the random coding bound, Pg < 2~ NR °, which gives a bound 
on the average block error probability of all codes of length N. The importance of R 0 
transcends its use in the error probability bound, as shown by Massey [4]. We assume that 
the decoder is given not only the received signal r;(t), but may be given an estimate of the 
fading amplitude a during each signal interval. We consider two cases: complete and no side 
information, i.e., the receiver either knows the exact value of a or has no information about a. 
We further assume a slowly varying fading amplitude a, which remains essentially constant 


1 


4 
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over a signaling interval. Also, the signals Si(t) are interleaved so that the amplitude fades 
affecting successive symbols are independant. 

Without Side Information 

If the decoder has no information about a, R 0 is given by 

Ro = — log 2 min ( (1 — y/irXc erfc(Ac)e Ac )j , (3) 

A Vjt=i 1=1 ) 

where A is the signal alphabet size, and Xj. are the signals, pk and pi axe the probabihties 
with which they are chosen, N 0 is the one sided spectral noise power density, A is the 
Chemoff parameter and c = s_ 2 k — Equation (3) is too messy to interpret in the sense 

that a design criterion can be deduced from it. A computer evaluation shows interesting 
differences between different constellations. It is most noteworthy that the rectangular 
signal constellations have a significantly poorer Rq than the constant envelope signal sets, 
while they have a slightly better Rq on the Gaussian channel. 

With Side Information 

Here we consider a decoder that is given perfect knowledge of the fading amplitude. Such 
information can partly be extracted from the received power of a pilot tone, which is needed 
anyway in order to synchronize the phase. In this case R 0 is given by 

/ A A \ 

R 0 = - log 2 £ f - )2 • (4) 

\k=u=i 1 + / 

This equation is considerably less complex and provides a key to code design for fading 
channels, as we shall see in the next section. 

TRANSFER FUNCTION BOUND 

Any decoding error in a trellis code is characterized by a set of incorrect branches which 
the decoder chooses over the correct branches. With each incorrect path we may associate 
a sequence of incorrect trellis states S'/., while the sequence of correct states is Sk- Any 

A /V 

error event of length l can then be described by / state pairs, (S 0 , So), . . . ,(Si, Si), with 
So = So, Si = Si, and Sk ^ Sk for 0 < k < l, i.e., the incorrect path must not touch 
the correct path. Associated with these paths are two symbol sequences a = [a 0 , a*, . . . , a/] 
and a = [a 0 ,di, . . . ,S/]. The probability of an error event may be upper bounded by the 
Chernoff bound on the two codewords which the paths generate. We may therefore write 

Pr[(So Si) - s,)l < C( a, a) = ff C(a h aj) (5) 

j-o 

where C(aj,a,j) is the Chernoff bound on the two signals a,j and a r The last equality is a 
consequence of the interleaving process. The event error probability is now upper bounded 
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( 6 ) 


by the sum of the probability of all incorrect paths, i.e., 

n C( a jky a jk), 

error o 
paths k 3 

where d k is the length of path k. Let us now contrast the Chemoff bound for the Gaussian 
channel with that for the fading channel. In the Gaussian case 

I! C(a jk ,a jk ) = e^o (7) 

j=0 

i.e., the error probability is determined by the total Euclidean distance of each error path. 
In the fading case with side information we have 


dk — 1 

J"J Cfajk, CLjk) 
j=0 


dk - 1 

n 


j- 0 


1 


1 _L ( a Tfc —&jk) 2 

1 ^ 4 N 0 


" nfe 1 fo-t-a*)*’ 

a ]k*‘ a jk 


( 8 ) 


where d' k equals d k , the length of the path less the number of matches, i.e., the number of 
branches where a,j k = dj k . We call d' k the effective length of path k and dl — min(dfc) the 
effective length of the code. The approximation is valid for SNR values > 6dB, which is 
usually the case for transmission over a fading channel. The signal to noise ratio, or SNR, 
is defined in the usual way as SNR = \og 10 (Eb/N 0 ), where is the average signal energy of 
the signal set in question, i.e., E k = '£ J f=\Pi x }- R becomes clear that the error performance 
in the fading case is dominated by the paths with the shortest effective length, i.e., d' k = d! 
and among those by the one having the smallest product in the denominator, i.e., with the 
smallest product distance. While preparing this paper, it has come to our attention that 
Simon & Divsalar [5] have come up with a similar interpretation. 


CODES FOR THE FADING CHANNEL 


In this section we examine the performance on a fading channel of bandwidth efficient codes 
that were designed for the Gaussian channel and of some new codes designed for a fading 
channel. From equation (8) it is seen that parallel transitions in the code (d 1 = 1) are most 
detrimental to code performance and should be avoided. We now concentrate on trellis 
codes without parallel transitions. Ungerboeck [3] gives a list of optimal trellis codes based 
on convolutional codes. His list is given in table 1, together with their effective lengths d! 
and minimum product distance mpd. Table 2 gives a list of codes that will outperform the 
Ungerboeck codes on a fading channel. Codes in the two tables have to be compared on the 
basis of equation (8), i.e., the dominant parameter is d' since it determines the slope of the 
error bound. Among codes with equal d', the one with larger mpd is superior. For example, 
for the v = 6 code the coding gain at an error probability of 10 -5 is 8 dB. All the codes 
use an 8PSK signal set. We have seen in section II that, unless perfect side information 
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is available, constant amplitude schemes are preferable. It also seems unlikely that larger 
signal sets that 8PSK will be used in the near future due to the problem of synchronizing 
the phase properly on fading channels. 
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Table 1: Ungerboeck’s 8PSK codes Table 2: New codes 

CONCLUSIONS 

We conclude that bandwidth efficient coding is feasible on fading channels, but the impor- 
tant design parameter is no longer the Euclidean distance but two parameters, the effective 
length and the minimum product distance. Intuitively a good code in a nonuniform noise 
environment should try to spread the ” distance” between two codewords evenly over all 
the branches over which the codewords differ. The same basic idea is used in diversity 
transmission, which is usually employed to improve performance on fading channels. 
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